Main
Anderson Banihirwe
I contribute to and maintain several libraries within the open source scientific Python stack, particularly around improving scalability of Python tools in order to handle terabyte-scale datasets on HPC and cloud platforms.
Education
B.S., Computer Systems Engineering
University of Arkansas at Little Rock
Little Rock, AR
2018 - 2014
Professional Experience
Software Engineer
National Center for Atmospheric Research
Boulder, CO
current - 2018-10
- Assisted with the development, implementation, and maintenance of software contributions to Open Source packages –namely dask, xarray, intake, jupyter — used by the Pangeo community as they pertain to NCAR systems or data.
- Assisted with the development and maintenance of Pangeo-related installations and deployments on premise High Performance Computing (HPC) systems or commercial cloud that access NCAR resources.
- Assisted with the development and deployment of live (virtual or in-person) and online/self-paced education material, including training websites, Jupyter notebook examples, and teaching modules.
Software Developer Intern
Quansight
Austin, TX
2018-09 - 2018-05
- Developed xndframes, a Pandas ExtensionDtype/Array backed by xnd, a container type that maps most Python values relevant for scientific computing directly to typed memory.
- Worked on integrating cuDF - GPU dataframe library with Apache Arrow library.
- Worked closely with a customer to port existing Postgres code base to a Dask based workflow.
Data Science Intern
First Orion
Little Rock, AR
2018-04 - 2017-11
- Built scoring, predictive models with Scikit-learn, Dask, and Apache Spark using First Orion’s proprietary telecommunication data.
Research Intern
National Center for Atmospheric Research
Boulder, CO
2017-08 - 2017-05
- Developed spark-xarray, a Python package that integrates PySpark and xarray for climate data analysis.
Selected Publications, Posters, and Talks
The Pangeo Ecosystem: Interactive Computing Tools for the Geosciences: Benchmarking on HPC
2019 Supercomputing Conference Workshop on Interactive High-Performance Computing
N/A
2020
- Authored with Tina Erica Odaka, Guillaume Eynard-Bontemps, Aurelien Ponte, Guillaume Maze, Kevin Paul, Jared Baker, Ryan Abernathey.
Zarr: chunked, compressed, multidimensional arrays
2020 Cloud Native Geospatial Outreach Day
Online
2020-09
- Invited talk about Zarr, an open source data format for the storage of chunked, compressed, multidimensional arrays.
- Recorded talk: https://www.youtube.com/watch?v=cOMkgQssVPk&list=PL3QzFgBMGnbQWbW-V09AzSfCbnf6Q87Rq&index=4&t=1471s
- Slides: https://talks.andersonbanihirwe.dev/zarr-cloud-native-geospatial-2020.html
Intake-ESM – Making It Easier To Consume Climate and Weather Data
2020 ESIP Summer Meeting
Online
2020-07
- Invited talk about intake-esm, an intake plugin for working with Earth System Model (ESM) datasets.
- Recorded talk: https://youtu.be/OpzF6IwIHRA?t=1400
- Slides: https://talks.andersonbanihirwe.dev/intake-esm-esip-2020.html
Intake / Pangeo Catalog: Making It Easier To Consume Earth’s Climate and Weather Data
2020 EarthCube Annual Meeting
Online
2020-06
- Contributed Jupyter notebook about Pangeo’s data cataloging efforts.
Interactive Supercomputing with Dask and Jupyter
2019 Scientific Computing with Python conference
Austin, TX
2019-07
- Contributed talk about Dask and Jupyter.
- Recorded talk: https://youtu.be/vhawO8fgD64
- Slides: https://andersonbanihirwe.dev/talks/dask-jupyter-scipy-2019.html
Beyond Matplotlib - Tutorial: Building Interactive Climate Data Visualizations with Bokeh and Friends
2018 UCAR Software Engineering Assembly conference
Boulder, CO
2018-04
- Contributed tutorial about interactive visualization with Python.
- Tutorial materials: https://github.com/andersy005/beyond-matplotlib-tutorial-sea-2018
PySpark for “Big” Atmospheric Data Analysis
Eighth Symposium on Advances in Modeling and Analysis Using Python
Austin, TX
2018-01
- Contributed talk about spark-xarray.
- Recorded Talk: https://ams.confex.com/ams/98Annual/webprogram/Paper334546.html
- Slides: https://opensky.ucar.edu/islandora/object/conference%3A3443